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Abstract. For any densely defined, lower semi-continuous trace r on a C*— algebra 
A with mutually commuting C* — subalgebras A\, A2, ■ ■ ■ A n , and a convex function / 
of n variables, we give a short proof of the fact that the function (x\, X2, ■ ■ • , X n ) — > 
r(/(xi, X2, ■ ■ • , x n )) is convex on the space (Bi=i(^i)sa- If furthermore the function / 
is log-convex or root-convex, so is the corresponding trace function. We also introduce 
a generalization of log-convexity and root-convexity called £— convexity, show how it 
applies to traces, and give some examples. In particular we show that the Kadison- 
Fuglede determinant is concave and that the trace of an operator mean is always 
dominated by the corresponding mean of the trace values. 
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1. Introduction. The fact that several important concepts in operator theory, 



in quantum statistical mechanics (the entropy, the relative entropy, Gibbs free 
energy), in engineering and in economics involve the trace of a function of a self- 
adjoint operator has motivated a considerable amount of abstract research about 
such functions in the last half century. An important subset of questions involve 
the convexity of trace functions with respect to their argument. 

The convexity of the function x — > Tr(/(x)), when / is a convex function of 
one variable and a; is a self-adjoint operator, was known to von Neumann, cf. [12, 
V.3. p. 390]. An early proof for f(x) = exp(x) can be found, e.g. in [19, 2.5.2]. A 
proof given by the first author some time ago describes the trace Tr(/(x)), where 
/ is convex, as a supremum over all possible choices of orthonormal bases of the 
Hilbert space of the sum of the values of / at the diagonal elements of the matrix 
for x. This proof was communicated to B. Simon, who used the method to give 
an alternative proof of the second Berezin-Lieb inequality in [21, Theorem 2.4], see 
also [22, Lemma II. 10.4]. Simon only considers the exponential function, but the 
argument is valid for any convex function, cf. [10, Proposition 3.1]. The general 
case for an arbitrary trace on a von Neumann algebra was established by D. Petz 
in [17, Theorem 4] using the theory of spectral dominance (spectral scale). 

The basic fact, for one variable x and a positive convex function /, is that 

]T. /((</>„ *</>,))< !*(/(*)), (1) 
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where the sum - finite or not - is over any orthonormal basis. Equality is obviously 
achieved if the basis is the set of eigenvectors of x. Thus, 

Tr(/(x)) = sup^./((^,x^)). (2) 

Essentially, the proof is the following: If {(pj} is the eigenvector basis (with eigen- 
values Xj) and ipj is some other basis, then ipj = J2kCjk4>k, and the coefficients 
of the unitary matrix C satisfy V- |Cjfc| 2 = 1 = J2k \Cjk\ • Then (ipj,x ifjj) = 
Efc \ c jk\ 2 X k and, by Jensen's inequality, f(J2k \ C jk\ 2 ^k) < Efc \ c jk\ 2 f(X k ). Now, 
summing on j we obtain (1). 

Equation (2) implies the convexity of x — » Tr(/(x)), because any supremum 
of convex functions is convex. Moreover, if f(x) = exp(x) then one sees that 
Tr(exp(a;)) is log-convex (i.e. log( Tr(exp(x))) is convex), because an ordinary 
sum of the type ^exp(aj), with a,j in R, is log-convex. Similarly, if f(x) = \x\ p 
(with p > 1) we see that x — > ( Tr(|a;| p )) is convex. In particular, the Schatten 
p— norms are subadditive. 

Even more is true. If f(x) = exp(g(x)) and g is convex, then x —* Tr(f(x)) is 
log-convex. Similarly, if f(x) = \g(x)\ p then x — > ( Tr(/(x))) is convex. 

A natural question that arises at this point is this: Are there other pairs of 
functions e, £ of one real variable, beside the pairs exp, log and |t| p , |£| 1 ' p , for 
which x — ► £ ( Tr(/(a;))) is convex whenever / is £— convex, i.e. f(x) = e (g(x)) and 
g is convex? In the second part of our paper we answer this question completely 
and give a few examples, which we believe to have some potential value. But first 
we turn to the question of generalizing (2) to functions of several variables. 

We start with a function /(A) of n real variables (with A = (Ai, A2, . . . , A n )). 
Next we replace the real variables Xj, by operators Xj, similar to the one- variable 
case. An immediate problem that arises is how to define f(x) in this case. The 
spectral theorem, which was used in the one- variable case, fails here unless the Xj's 
commute with each other. Therefore, we restrict the x±, X2, ■ • • , x n to lie in com- 
muting subalgebras A\,Ai, . . . ,A n , and then f(x) and Tr(/(a;)) are well defined 
and it makes sense to discuss the joint convexity of this trace function under the 
condition that / is a jointly convex function of its arguments. (We do not investi- 
gate the question whether f(x) is operator convex - only the convexity under the 
trace.) 

More generally, we replace the trace Tr in a Hilbert space setting by r, a densely 
defined, lower semi-continuous trace on a C*— algebra A; i.e. a functional defined 
on the set A + of positive elements with values in [0, 00], such that r(x*x) = r(xx*) 
for all x in A. We further assume that A comes equipped with mutually commuting 
C*— subalgebras Ai, A2, . . . , A n . 

The convexity of the function x — > r(f(x)) on the space of n— tuples in 
©i=i(^i)sa w as proved by F. Hansen for matrix algebras in [5]. (Here A SSb denotes 
the self-adjoint elements in A.) His result was extended to general operator algebras 
by the second author in [16] . Both these proofs rely on Frechet differentiability and 
some rather intricate manipulations with first and second order differentials. 

We realized that the argument in (2) will work in this multi-variable case as 
well, thereby providing a quick proof of the convexity. The key observation is that 
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basis (in the case that A is a matrix algebra) that simultaneously makes all the Xj 
diagonal. 

For a general C*— algebra (e.g. the algebra of continuous functions on an in- 
terval) we have to find something to take the place of an orthonormal basis. This 
something is just the commutative C* — subalgebra generated by the n commuting 
elements x\, X2, • • • , x n . It depends on x, of course, but that fact is immaterial for 
computing the trace for a given x. 

The main result in the first part of this paper - proved in section 6 - is 

2. Theorem ( Mult i variable Convex Trace Functions). Let f be a continuous 
convex function defined on a cube I = IiX---xI n in MJ 1 . IfAi,..., A n are mutually 
commuting C*—subalgebras of a C* — algebra A and t is a finite trace on A, then 
the function 

(xi,...,x n ) -*T(f(x 1 ,...,x n )), (3) 

defined on commuting n— tuples such that Xi E (Ai) 1 ^ for each i, is convex on 
©(^4i)sa- (Here, (^i)sa denotes the set of self- adjoint elements in A^ whose spectra 
are contained in 1^.) 

If t is only densely defined, but lower semi- continuous, the result still holds if 
f > 0, even though the function may now attain infinite values. 

In the second part we explore the natural generalization of the concept of log- 
convexity mentioned before and explained in detail in section 7. We find a necessary 
and sufficient condition on a concave function £ that ensures that £— convexity of a 
function / = e o g, with e = £~ x and g convex, implies £— convexity of the function 
x — > r(f(x)) for a tracial state r on a C*— algebra A. (A tracial state is a trace 
satisfying r(l) = 1.) The main result there - proved in section 9 - is 

3. Theorem {£— Convex Trace Functions). Let f be a continuous function 
defined on a cube I — I\ x • • • x I n in M, n , and assume furthermore that f is 
£— convex relative to a pair of functions e,£ as described in section 7, where £'/£" 
is convex. If A±, . . . , A n are mutually commuting C* — subalgebras of a C* — algebra 
A and t is a tracial state on A, then the function 

(xi,...,x„) — >r(/(xi,...,x„)), (4) 

defined on commuting n— tuples such that Xi G {Aj) 1 ^ for each i, is also £— convex 
on ®(^4i)sa- tf moreover £' /£" is homogeneous and / > the result holds for any 
densely defined, lower semi- continuous trace t on A. 

In sections 4, 5 and 7 we set up some necessary machinery, whereas the key 
lemmas are in sections 6 and 8. 

The third part of the paper, sections 9-22, consists of examples where we apply 
the preceeding results. In particular we investigate the n-fold harmonic mean of 
positive operators and show how its trace behaves under certain concave transfor- 
mations. As a corollary we prove in Proposition 23 that the trace of any mean (in 
the sense of Kubo and Ando [8]) is dominated by the corresponding mean of the 
trace values. 

Throughout the paper we have chosen a C*— algebraic setting with densely de- 
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as well have developed the theory for von Neumann algebras with normal, semi- 
finite traces; in fact we need this more special setting in the proof of Lemma 
6. However, the Gelfand-Naimark- Segal construction effortlessly transforms the 
C* — algebra version into the von Neumann algebra setting, so there is no real dif- 
ference between the two approaches. 

4. Spectral Theory. We consider a C*— algebra A of operators on some Hilbert 
space ft and mutually commuting C*— subalgebras Ai, • • • , A n , i.e. Ai C Aj for all 
% 7^ j. For each interval Ii we let (AA^ denote the convex set of self-adjoint elements 
in Ai with spectra contained in Ii. If / = I\ x • • • x I n C M, n and / is a continuous 
function on / we can for each x = (x±, • • • , x n ) in ©(^4j)ga define an element f(x) 
in A. To see this, let Xi = f XdEi(X) be the spectral resolution of Xi for 1 < i < n. 
Since the x^s commute, so do their spectral measures. We can therefore define the 
product spectral measure E on / by E(S\ x • • • x S n ) = Ei(Si) ■ ■ ■ E n (S n ) and then 
write 

f(x)= f f(X 1 ,---,X n )dE(X 1 ,... ,X n ). (5) 



Of course, if / is a polynomial in the variables Ai, . . . , A n we simply find f(x) by 
replacing each A^ with Xi. The map / — ► f(x) so obtained is a *— homomorphism 
of C(I_) into A and generalizes the ordinary spectral mapping theory for a single 
(self-adjoint) operator. In particular, the support of the map (the smallest closed 
set S such that f(x) = for any function / that vanishes off S) may be regarded 
as the "joint spectrum" of the elements x±, . . . , x n . 

This theory applies readily in the situation where A = A\ <g> • • ■ <g> A n , but, 
curiously enough, the tensor product structure (used extensively in [5] and [16]) is 
not needed in our arguments. 

5. Conditional Expectations. Let r be a fixed, densely defined, lower semi- 
continuous trace on A, and let C be a fixed commutative C* — subalgebra of A. 
By Gelfand theory we know that each commutative C* —subalgebra of A has the 
form Cq(T) for some locally compact Hausdorff space T. Note now that if y G C+, 
the positive part of C, and has compact support as a function on T, then y = yz 
for some z in C+. Since the minimal dense ideal K(A) of A is generated (as a 
hereditary *— subalgebra) by elements a in A + such that a = ab for some b in A+, 
cf. [14, 5.6.1], and since r is densely defined, hence finite on K(A), it follows that 
r(y) < oo. Restricting r to C we therefore obtain a unique Radon measure fie on 
T such that 

f y(t) dfi c (t) = r(y) , yeC, (6) 



cf. [15, Chapter 6]. Furthermore, if x E A + the positive functional y — > r(yx) on 
C determines a unique Radon measure on T (by the Riesz representation theorem) 
which is absolutely continuous with respect to fie-, in fact dominated by a multiple 
of nc (by the Cauchy-Schwarz inequality). By the Radon-Nikodym theorem, there 
is a positive function <&(x) in L™ C (T) such that 

/ ' y(t)$(x)(t) dnc(t) = r(yx) , yeC. (7) 

Extending by linearity, this defines a map $ from A to L^ c (T) which is linear, pos- 
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<fr(y) — y almost everywhere if y G C (with respect to the natural homomorphism 
of C = C Q (T) into L~(T)). 

When t is faithful and C and A are von Neumann algebras, the map $ is a 
classical example of a conditional expectation, cf. [7, Exercise 8.7.28]. 

6. Lemma. With notations as in sections 4 and 5, take x = (x\, . . . ,x n ) in 
©i=i(^-i)sa an d l e t f be a continuous, real and convex function on 1 = I\ x • • • I n . 
If t is unbounded we assume, moreover, that f is positive. For each commutative 
C*—subalgebra C of A set 

<fic(f,x) = J f($(x 1 )(t),...,<f>(x n )(t))dLL C (t). (8) 

Then <fic(f,x) < T (f(x))> with equality whenever Xi G C for all i. 

Proof. If Xi G C for all i then also f(x) G C, and since <&(xi) = x\ almost everywhere 
we get by (6) that 

<Pc(f,x) = J f{xi(t), . ..,x n (t)) d^ c {t) = J f(x)(t) dfi C (t) = r(f(x)) . (9) 

To prove the inequality for a general n— tuple we first assume that r is a normal, 
semi-finite trace on a von Neumann algebra M, [7, 8.5] or [20, 2.5.1], and put 

M T = {xeM I r(\x\) < oo}. (10) 

We then assume that A = M^r, the norm closure of M T , so that A is a two-sided 
ideal in M. We further assume that each Ai is relatively weakly closed in A, i.e. 
A{ = A fl Mi for some von Neumann subalgebra Mi of M. This has the effect that 
every self-adjoint element Xi in Ai can be approximated in norm by an element yi in 
Ai with finite spectrum and sp(j/j) C sp(xi). Under these assumptions we notice 
that since both / and $ are norm continuous it suffices to establish the inequality 
for the norm dense set of n— tuples x = (xi, . . . ,x n ), such that each Xi has finite 
spectrum. Since the x^s commute mutually there is in this case a finite family {pk} 
of pairwise orthogonal projections in A with sum 1 such that 



./, ; 



^A^pfc, 1 < i < n, (11) 



where Xik G Ii (and repetitions may occur). If we set X k = (Aifc, • • • , X n k) this 
means that 

f(x) = J2f(^)Pk- (12) 

As Y.Pk = 1 in A also J2 ®(Pk) = 1 in L™ C (T), so that ^ $(p*)(t) = 1 for (almost) 
every t in T. Since / is convex this implies that 

/($(x 1 )(t),...,$(x n )(t))=/(^A lfc $(p fc )(t),...,^A nfc $(p fc )(t)) 



6 ELLIOTT H. LIEB & GERT K. PEDERSEN 

Consequently, by (8), (13) and (6), 

= £ /U*)t(p*) = t(J2 f(h)Pk) = r(f(x)) . 



(14) 



To prove the inequality for a general C*— algebra A with commuting C*— sub- 
algebras Ai consider the GNS representation (7r r ,i3 T ) associated with r. By con- 
struction, cf. [14, 5.1.5], we obtain a normal, semi-finite trace r on the von Neumann 
algebra M = tv t (A)" such that r(7r T (x)) = r(x) for every x in A + , and we can de- 
fine the two-sided ideal M T and its norm closure M^r as in (10). If we now put 
A = M= and Ai = A n (7r T (Aj)) _w (closure in the weak operator topology) for 
1 < i < n, we have exactly the setup above. Our argument, therefore, shows that 
the inequality holds in the setting of A, Ai and r, and an arbitrary commutative 
C*-subalgebra C of A. 

Note now that since r is densely defined on A we have 7r T (A) C A. If C is a 
commutative C* — subalgebra of A we put C = n T (C), and observe that C has the 
form C = Cq(T) for some closed subset T of T with nc(T \T) =0. The map $ 
from A to L°° (T) defined in section 3 will therefore satisfy the restriction formula 

5(tt t (x)) = $(x) | f (15) 

for every a; in A. But then, by our previous result in (14), 

Vc(f,x) = / /($(xi),...,$(a; n )) rf^ c = / /($(7r T (xi)),...,$(7r T (x n ))) rf^ c 
= ^(/' ^r(x)) < T(f(Vr(x)) = r(^Af(x)) = r(/(x)) . (16) 

□ 



Proof of Theorem 2. It is evident that the function x — * </?c (f,x), defined in Lemma 

6, is convex on ©(^4i)sa f° r eac h C, being composed of a linear operator $(= $c), 

a convex function /, and a positive linear functional - the integral. Moreover, if 

C denotes the set of commutative C*— subalgebras of A, it follows from Lemma 6 

that 

r(/(x)) = sup ipc(f,x) , (17) 

c 

the supremum being attained at every C that contains the commutative C*— sub- 
algebra C*(x) generated by the (mutually commuting) elements x\, . . . , x n . Thus 
x — > t(/(x)) is convex as a supremum of convex functions. D 

This concludes the first part of our paper and we now turn to Theorem 3. 

7. £— Convexity. We consider a strictly increasing, convex and continuous func- 
tion e on some interval /, and denote by £ its inverse function (so that £(e(s))) = s 
for every s in I and e(£(t)) = t for every t in e( J)) . If g is a convex function defined 
on a convex subset of a linear space then / = e o g is convex as well, but in some 
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say in this situation that the function / is £— convex. This terminology is chosen to 
agree with the seminal example, where e = exp and £ = log. 

Our aim is to show - under mild restrictions on £ - that £— convexity has some 
remarkable structural properties, being preserved under integrals and traces. By 
contrast, the concept of £— concavity - with the obvious definition - seems to be 
less interesting. 

8. Lemma. If I is an interval in M. and e is a strictly increasing, strictly convex 
function in C 2 (I) with inverse function £, then for each probability measure [i on 
a locally compact Hausdorff space T , and for each £— convex function f defined on 
some cube I in M. n , the function 

(ui, ...,«„)-> J f (ui(t), . . . , «„(*)) dfi(t) , m G L™(T) , (18) 

is also £— convex on the appropriate n— tuples in L°f(T) if and only if the function 
ip, defined by <p(e(s)) = (e"(s)) (e'(s)) , is concave. 

Proof. Since / = e o g for some convex function g on I, we see that to prove the 
Lemma it suffices to show that the increasing function 

*(„)=/(/,(„(«))«*)), « e£ ~(T), (19) 

is convex on (L£° (T)) 1 . Clearly, this is also a necessary condition. Considering 
instead the scalar funtions 

h(s) = £ ( I e (u(t) + sv(t)) dn(t)\ (20) 

for arbitrary elements u, v in L°f(T), where the range of u is contained in the 
interior of /, we notice that convexity of k is equivalent to convexity at zero for all 
functions of the form h, and we therefore only have to show that h"(0) > 0. Setting 
r(s) — J e(u + sv)dfj, we compute 

h'(s) = £'(r) e'(u + sv)v d/j, ; 

ff V f (21) 

h"(s) = £"{r) / e'(u + sv)v d/i + £\r) / e"(u + sv)v 2 d\x . 



On the other hand, since £(e(t)) = t we also have 

£'(e(t))e'(t) = l- 

£"(e(t))(e'(t)) 2 + £'(e(t))e"(t)=0. 

In our case we can let e(t) = r(s), whence t = £(r(s)) = h(s). We can therefore 
eliminate £"{r) in (21) to get the expression 

2 
-2/ / „//„. i „„.\„.J..\ i t)l(„\ I „"{„. i „„A„.2 



h"(s) = -£'(r)e"(h(s))(e'(h(s)))- 2 I / e' (u + sv)v dfi j +£\r) / e"(u + sv)v 2 dfi 
= £'(r) I [ e"(u + sv)v 2 d/j-e"(h(s))(e'(h(s)))- 2 ( [ e' (u + sv)v dfi) ) . (23) 
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Since e is strictly increasing, so is £, which implies that £'(r) > 0. It follows from 
(23) that h"(0) > if and only if 



e"(/i(0))e'(/i(0))- 2 / e'{u)vdn) < / e"(u)v 2 d\i . (24) 



Now define the function <p on e(J) by <p(e(s)) = (e / (s)) 2 (e // (s)) _1 . Since e(/i(0)) = 
r(0) = f e(u) dfi and e"(h(0)) > because e is strictly convex we see that (24) is 
equivalent to the inequality 

e'(u)vdfj,\ < <p I / e(u) dji I e"(u)v 2 d(j>. (25) 



For ease of notation put </? (/ e(u) dfi) = <p(e(u)), and choose a function w in LST) 
with J wdfj, = 1. Then consider the quadratic form 



A 2 / e"(u)v 2 djji — 2A / e'(u)vd/j, + <p(e(u)) w dji 

' J J ( 26 ) 

(AVe"(w) - 2Ave / (w) + ^(e(w))w) d/i . 



By construction this form is positive if and only if (25) is satisfied. But (26) 
expresses the integral of a function which is itself a quadratic form. The minimum 
in (26) therefore occurs for Xve"(u) = e'(u) and equals 

(v(e(u))w-(e"(u))- 1 (e'(u)f)d tt 

is \ t (27) 

(<p(e(u))w — (p{e{u))) djj, — ip [ I e{u) dji] — j (p{e{u)) d\x . 



Evidently this expression is non-negative if and only if ip is concave. □ 

9. Remark. Using the equation e(£(t)) = t as in (22) we easily find that </?(£) = 
— (£"(t)) _1 £'(t), so that the condition in Lemma 8 translates to the demand: 

The (negative) function t — > £'(t)/£"(t) must be convex. (28) 

Note also that the condition in (27), viz. 

(f(e(u)) dfi < <p I e(u) djjA (29) 

makes sense for an arbitrary measure \i and provides a necessary and sufficient 
condition for the function defined in (18) to be £— convex. However, in order to 
satisfy (29) for an arbitrary (point) measure, the function ip must have sip(t) < ip(st) 
for all s > 0, which forces it to be homogeneous, i.e. p(st) = s<p(t) for s > 0. It 
follows that the function defined by (18) in Lemma 8 is £— convex for an arbitrary 
measure \i if and only if 

£'(t)/£"(t) = jt for some non-zero number 7. (30) 

Of course, this can only happen when the domain of £ is stable under multiplication 
with positive numbers, so it is either a half-axis or the full line. But since the 
expression in (30) must be negative, only half-axes can occur. 

Proof of Theorem 3. The Theorem follows by using Lemma 6, as in the proof of 
Theorem 2, combined with Lemma 8. □ 
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10. Examples. Evidently the condition that £' /£" be convex is not very restric- 
tive, and is satisfied by myriads of functions, of which we shall list a few, below. On 
the other hand, the demand that £' /£" be homogeneous is quite severe, and only 
four (classes of) functions will meet this requirement: 

(i) Let £{t) = log(t) for t > 0. We get e(s) = exp(t) for s in R and £'/£" = -t. 

This is the classical example, and by far the most important. Clearly £(t) = clog(t) 

for any c > can also be used, but we omit this trivial parameter here and in the 

following examples. 

(ii) Let £{t) = t l ' p for t > and some p > 1. We get e(s) = t p for s > 

and £'/£" = —"ft, where 7 = p/(p — 1) > 1. The root examples are also fairly well 

known. Indeed, it is a very general fact that whenever / is a convex (resp. concave) 

function that is homogeneous of some degree p > then (i) we must have p > 1 

(resp. p < 1) and (ii) the function f x ' p is automatically convex (resp. concave). 

This is discussed in detail in the proof of Corollary 1.2 in [9]. 

(iii) Let £{t) = -t~ a for t > and some a > 0. We get e(s) = (-s)" 1 /" for s < 

and £'/£" = --ft, where 7 = (1 + a) -1 < 1. 

(iv) Let £{t) = -(-t) p for t < and some p > 1. We get e(s) = -(-s) 1 /* for s < 

and £'/£" = -ft, where 7 = (p - l)" 1 > 0. 

Non-homogeneous examples are not hard to come by. Without any apparent 
order we mention these: 

(v) Let £{t) = — exp(— at) for t in R and some a > 0. We get e(s) = —a -1 log(— s) 
for s < and £' /£" = —1/a. In applications of Theorem 3 the parameter a 
disappears, since £(r(e(a))) = — exp(— ar(— a -1 log(— a))) = — exp(r(log(— a))) for 
any operator a < 0; so we may as well assume that a — 1. 

(vi) Let £{t) = log(log(t)) for t > 1. We get e(s) = exp(exp(s)) for s in R and 
£'{t)/£"{t) = — tlog(t)(l + log(t)) -1 , which is only convex for t < e. So on the 
intervals 1 < t < e and —00 < s < we can use the functions log log and expexp. 
(vii) Let £(t) = (\og(t)) 1 ' p for t > 1 and some p > 1. Here e(s) = exp(s p ) for s > 
and by computation we find that £' /£" is convex for t < exp(l + 1/p). The allowed 
interval for e is < s < (1 + p)p~ 2 . 

(viii) Let £{t) = (1 - (1 - t) p ) 1/p for < t < 1 and some p > 1. We get e(s) = 
1 - (1 - s^) 1 ^ for < s < 1 and £'{t)/£"{t) = (p - l)" 1 ((1 - t) p+1 - (1 - t)), 
which is a convex function on the unit interval. 

(ix) Let £{t) = t^ p (l + tVP)-i for t > and some p > 1. Here e(s) = s p (l - s)" p 
for < s < 1. For p > 1 we find after some computation that £'{t)/£"{t) = 
2pt 1+1 ' p ((p — l)(p — 1 + (p + l)t 1 ' p )\ plus a linear term, and this is a convex 
function. For p = 1 we simply get £'{t)/£"{t) = — 1(1 + t), and we note that this 
example is just a translation of (iii) (replacing t by 1 + 1 and adding 1). 

11. Corollary. If f is a positive, continuous, log— convex function on a cube £ in 
M. n , and A\, . . . , A n are mutually commuting C* — subalgebras of a C* — algebra A, 
then for each densely defined, lower semi- continuous trace t on A the function 

(xi,...,x n ) — > r{f{xx, . . . , x n )) , (31) 

defined on commuting n— tuples such that Xi E (Ai) 1 ^ for each i, is also log — convex 
on ®(^-i)sa- //, instead, f x l p is convex for some p > 1, or if —f~ a is convex for 

„„^,„ „ ^ r\ — . ;t t ^ n 7 / t\v a„ „ 4-U — „?„„ u „ A4-„. „f jt„ 
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respective functions 



i 



(xi, . . . , x n ) — »■ (r(/(xi, . . . , x n ))) 1/p 

(xi, . . . , x n ) — »■ -(t(/(xi, . . . , x n )))~ a , (32) 

(xi,...,x n ) — »■ -(-r(/(xi,...,x n ))) p . 

12. Remarks. The Corollary above applies to some unexpected situations. Thus 
we see that the function 

(x, y)^log(r(exp((x + y) 2 ))), (33) 

where x and y are self-adjoint elements in a pair of commuting C*— algebras A and 
-B, is (jointly) convex. The same can be said of the function 

(x, y)->log(T(exp(-xV))) (34) 

for < a, /3 and a + /3 < 1, defined on A + x B + . Applied to the root functions the 
Corollary shows that the function 

( x ,y)-,( T ((x + y y)) 1/p (35) 

is convex for 1 < p < q, and that also 

(xj)-.(r((l-xVf)) 1/p (36) 

is convex for p > 1 on the product of the positive unit balls of A and B. 

The last two cases in Corollary 11 are perhaps easier to apply in terms of concave 
functions. By elementary substitutions we find that if / is a positive concave 
function on some cube £ in M. n , then both functions 

^-(r((/(x))-))- 1/a ; 
£-(r((/(x)) 1/p )) P ; 

are concave on ®(^4i)sa for a > (and / > 0), and for p > 1. In particular we see 
that 

(r((x + y) 1/p ))) P > (r^))" + (rtf'*))" , (38) 

for all x, y in A + , so that the Schatten p— norms are super-additive for p < 1. 

Recall from [4] the definition of the Kadison - Fuglede determinant A associated 
with a tracial state r on a C* — algebra A: 

A(x) = exp(r(log |x|)) whenever x G A~ l . 

This is a positive, homogeneous and multiplicative map on the set of invertible 
elements, cf. [4, Theorem 1], closely related to the ordinary determinant from matrix 
theory. Note, however, that when A = M n (C) then A(x) = (det(|x|)) , because 
we have to use the normalized trace r = — Tr. 

n 

The following result is well known for matrices, at least for functions of one 
variable. 
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13. Proposition. For each strictly positive concave function f on a cube I in 
MJ 1 , and mutually commuting C* — subalgebras Ai, . . . ,A n of a C* — algebra A the 
operator function 

*->A(/(z)) (39) 

is concave on the appropriate n— tuples of commuting self-adjoint elements from 
®(^4i)sa- I n particular, the Kadison - Fuglede determinant is a concave map on 
the set of positive invertible elements. 

Proof. From example (v) in section 10 we see that the function 

x - - exp (r (log(-(-/(x))))) = -A(/(x)) (40) 

is convex, as desired. □ 

14. Remark. The concavity in (39) is closely related to one of the main results 
(Theorem 6) in [9] namely: x — > r (exp(z + log(x)) is concave for any self-adjoint 
operator z. In one sense (39) is stronger because it allows a general concave /, but 
when / is linear (39) is a corollary of Theorem 6 in [9], as we show now for one 
variable: 

We have to prove that 

exp (r (log(i(x + j/)))) > \ exp (r (log(x))) + \ exp (r (log(y))). 
Put z = — log {\{x + y)). Our condition is then that 1 > | exp (r (z + log(x))) + 
2 exp (r (z + log(j/))). However, r is a state and therefore Jensen's inequality ap- 
plies. Thus, exp (r (z + log(x))) < r (exp (z + log(x))), and similarly for y. By The- 
orem 6 of [9] we know that x — > r (exp (z + log(x))) is concave, which implies that 
ir(exp(^ + log(x))) + 2-r(exp(2 + log(y))) < r(exp(z + log(|(a; + y)))) = r(l) = 1, 
as desired. 

15. Operator Means. We recall from [8], see also [6, 4.1], that a Kubo-Ando 
mean on the set B + = B(f)) + of positive operators is a function a: B + x B + — ► B + 
such that 

(i) (lal) = l, 

(ii) (xi a y-i) < (x 2 a y 2 ) if x± < x 2 and y x < y 2 , 

(iii) a is (jointly) concave on B + x B + , 

(iv) z*{x a y)z = ((z*xz) a (z*yz)) for every invertible z in B(^). 

Of particular interest are the harmonic mean ! and the geometric mean # defined 
on positive, invertible operators by: 

xly = 2(x- 1 + y- 1 )- 1 = Mx + y)~ 1 y, 

X#y = £1/2^-1/2^-1/2)1/2^1/2 _ ( 41 ) 

(x # y was introduced in [18] and |(x ! y), the parallel sum, was introduced in 
[1]; see also [2], [6, 4.1] and [8].) The domain of definition for these means can be 
extended to all positive operators by a simple limit argument (replacing x and y 
with x + el and y + el, and taking the norm limit as e — > 0), and we shall tacitly 
use this procedure in the following. Thus we shall state all results for positive 
operators, but in the proofs assume that they are invertible as well. Note that 



-~™~"U1~~ ~~J 4-~U~4- 4-1 , 
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to the classical harmonic and geometric means when x and y are positive scalars, 
i.e. 

x ! y = 2xy(x + y)~ x and xj^y = ^fxy. (42) 



A straightforward application of the Cauchy-Schwarz inequality shows that 

r(x # y) < t(x) # r{y) (43) 

for any trace r. The corresponding result for the harmonic mean was proved for the 
ordinary trace Tr in [2]. The general version below (Proposition 21) is somewhat 
more involved, but also richer. For greater generality, but at little extra cost, we 
introduce the harmonic mean of an n— tuple of positive operators as 

{xi ! x 2 ! • • • ! x n ) = n(x^ + x? 1 H + x' 1 )' 1 , (44) 

and we note that this mean is symmetric in all the variables and increasing in each 
variable. The fact that it is also jointly concave may not be widely known, so we 
present a short proof. 

16. Proposition. The n—fold harmonic mean is a jointly concave function. 
Proof. As the expression in (44) is homogeneous all we have to show is that 

(E^T' + Gl^ 1 )" 1 ^ (Eto+K)- 1 )" 1 , (45) 

for any pair of n— tuples of positive invertible operators. Multiplying left and right 
by E^s + J/j) -1 we obtain the equivalent inequality 

(E(*. +*>-') {(E^y 1 + (£^D (»*+*)-') (46) 

<5^(^ + y l )" 1 - 

We now appeal to the fact that the operator function (x, y) — > y*x~ x y is jointly 
convex, [11], hence also jointly subadditive on the space of operators x, y, where x 
is positive and invertible. To see this, consider n— tuples (x^ and (yi) and define 
Zj = x- (yj — a), with a = (X^O Yl Vi- Then by computation we obtain the 
desired estimate 

o < E *; **■ = E ypJ'yi - (E vi) (E x >) _1 (E w) • ( 47 ) 

Breaking the left hand side of (46) into the sum of two terms and using (47) on 
each we obtain the larger operator 

y^(xi + y i )~ 1 x i (x i + yi)' 1 ) + y^ixi + y i )~ 1 y i (x i + yi)' 1 )) 
= ^2( Xi +yi) 1 (x i + y i )(x l + y i ) 1 = ^(xi + y l ) 1 , 

which is precisely the right hand side of (46), as claimed. □ 
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17. Remark. Note that (47) also shows that the n— fold harmonic mean is dom- 
inated by the arithmetic mean (the average). Indeed, 



xi ! x 2 ! ...lx n = | 


\n/, X k 




\ k=l 


n 


n 


sEi 1 ^ 1 )" 1 " 


■^J2 Xk - 



EiMlEK 1 ) (Ei 1 

(49) 



v fc=l / \fc = l / \fc=l 



fc = l fc = l 



This result is not surprising, since the harmonic mean (on pairs of operators) is the 
smallest symmetric mean, whereas the arithmetic mean is the largest, cf. [6, 4.1]. 

18. Proposition. Given positive, invertible operators x\,X2, ■ ■ ■ ,x n and y in 

M($)), let d = diag{xi,X2 ■ ■ ■ ,x n } and e = Yll i-iV ® e «j ^ n M n (B(^)). Then 
the following conditions are equivalent: 

(ii) ed x e < e , 
(iii) e < d . 

Proof. Assume first that y = 1, and set a = Y^k-i^ 1 ^ so that condition (i) 
becomes equivalent with a < 1. 
(i) -<=>- (ii). By computation 

n 

ed~ x e= 2_. «®ejj, (50) 

from which it follows that a < 1 if and only if ed~ 1 e < e. 

(i) =>- (iii). Let p = — e, and note that p is a projection in M n (B(5])). For every 

e > we have 

d" 1 = ( p+ (\- p ))d- l (p+(l-p)) < (1 + e)pd~ 1 p + (1 + e _1 )(l - p)d~ 1 (l - p) 
= (l + e)n~ 2 ed~ 1 e + (l + e~ 1 )(l-p)d~ 1 (l-p). (51) 

Now a < 1 by (1) and a fortiori <i _1 < 1. Moreover, (i) =>- (ii) and thus we get 

d' 1 < (1 + e)^" 2 e + (1 + e _1 )(l - p) = (1 + £)n _1 p + (1 + e _1 )(l - p) . (52) 

Taking inverses this means that 

d > (1 + £) _1 np + e(l + e) _1 (l - p) , (53) 

from which the desired inequality follows as e — ► 0. 

(iii) =>- (ii). If e < d, then e + el < d + el for every e > 0, whence 

(d + el)" 1 < (e + el)" 1 = (n + e)~ l p + £ _1 (1 - p) . (54) 

But then, since e(l — p) = 0, we have 
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and as e — *■ we obtain the desired estimate. 

In the general case, where y is arbitrary (positive and invertible), we define 

Xk = y~ 1 / 2 Xky~ 1 / 2 for 1 < k < n. Then with e = Ym j-i 1 ® e «j an< ^ ^ = 
diag{xi,X2, ■ ■ ■ ,x n } we observe that conditions (i), (ii) and (iii) become equivalent 
with the conditions 

(ii) ed~ 1 e < e , 
(iii) ~ e < d . 

But these conditions are exactly the ones we proved to be equivalent above, assum- 
ing that y = 1. □ 



19. Corollary. For any n— tuple (xi, x 2 , • • • , x n ) of positive operators in M(Sj) the 

harmonic mean [x\ ! X2 ! • • • ! x n ) is the largest positive operator of the form nz, 

such that 

(x x ... \ / z z ... z\ 



x 2 
V 







> 






7 



^ ^ 



V 



Z -2 



(56) 



D 



20. Remark. Note that Proposition 18 actually says slightly more than Corollary 
19, namely that the set of positive operators nz, such that z satisfies the matrix 
inequality in (56), is exactly equal to 



({xi ! x 2 I ...\x n ) 



+ 



nB(£)+. 



(57) 



This should be compared to the analogous result for the geometric mean of pairs 
of positive operators x and y. Here x # y is the largest positive operator z such 

that [ z y) > in M 2 (B(.f>)). However, one can find positive operators z such that 

z < x # y, without the matrix being positive. It suffices to take x — 1, so that 

x # y = y 1 / 2 . Evidently the matrix f z z \ is positive only if z 2 < y, and it is easy 

to find examples where z < y 1 ' 2 , but z 2 ^ y. 



21. Proposition. If r is a densely defined, lower semi- continuous trace on a 
C* — algebra A, and f is a strictly positive function on ]0, oof such that t — > (fit -1 )) 
is concave, then for all positive operators x\ 



. , x n G A and a > we have 



(r((f(x 1 \x 2 \ ...\ Xn )r)f^ 
< (r((/(xi)) Q )) 1/a ! (T(f((x 2 )Y)) 1/a \ •••! (r((/«)r)) 1/a • 



(58) 



Proof. Define g(t) = (f(t )) . Then from (37) we see that the function 



. t ' ( t „t„\\— a\\ 



-1/c 



nxn\ 
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is concave on the set of positive invertible elements in A. In particular, 

-l/a 



{r{(g(l/n(x^ + --- + x- 1 )))-)y 
> 1/n (r {(g(x^))-«)y 1/a + ■ ■ ■ + 1/n (r (G/fc 1 ))-"))" 170 

Taking the reciprocal values and noting that 1/n (x± +■ ■ ■+x~ 1 ) = (x\ ! • 
this means that 



(60) 



IxJ" 1 



(r((g(( Xl \ ■■■\x n )- 1 ))-«)) 1/a 
< n ((rMz?))-«))- 1/a + ■■■+ (tMz?))-"))- 1 '")' 1 (61) 

= (r((^ r 1 ))- a )) 1/a !---! (TMz?))-°)) 1/a . 
Since (7(t _1 ) _1 = /(£) this is the desired result. □ 

22. Remark. The result above applies to the functions t — ► t 1 ' p for p > 1, in 
particular to the identity function (and with a = 1); but it also applies to the 
functions t — >■ log(l + t 1 ' p ) for p > 1. 

On the abstract level Proposition 21 applies to C 2 — functions /, such that / and 
t — > t 2 f{t)~ 2 f'{t) are simultaneously increasing or decreasing (strictly) on ]0, oo[. 

Proposition 21 also applies to other (not necessarily symmetric) means of positive 
operators. Such means were introduced in [8] (see also [6, 4.1]). 

23. Proposition. For every Kubo - Ando mean a and for every densely defined, 
lower semi- continuous trace t on a C* — algebra A we have r(x a y) < r(x) a r(y) 
for all x, y in A + . 

Proof. For each mean a there is a unique probability measure \x on [0, oo] such that 



oo />oo 



xay=\ {tx ! y) (1 + 1/t) dfi{t) = \ I {(1 + t)x ! (1 + l/t)y) dfi(t) , (62) 



'0 ^0 



cf. [6, 4.1]. Applying Proposition 22 with fit) = t and a = 1 it follows that 

/oo 
r(te ! y) (1 + 1/t) dfi(t) 

,00 (63) 

< | / (tr(x) ! r(y)) (1 + 1/t) dfi(t) = r{x) o r(y) . 
Jo 

D 
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